Preserving medical correctness, readability and consistency in de-identified health records

نویسندگان

  • Kostas Pantazos
  • Søren Lauesen
  • Søren Lippert
چکیده

A health record database contains structured data fields that identify the patient, such as patient ID, patient name, e-mail and phone number. These data are fairly easy to de-identify, that is, replace with other identifiers. However, these data also occur in fields with doctors' free-text notes written in an abbreviated style that cannot be analyzed grammatically. If we replace a word that looks like a name, but isn't, we degrade readability and medical correctness. If we fail to replace it when we should, we degrade confidentiality. We de-identified an existing Danish electronic health record database, ending up with 323,122 patient health records. We had to invent many methods for de-identifying potential identifiers in the free-text notes. The de-identified health records should be used with caution for statistical purposes because we removed health records that were so special that they couldn't be de-identified. Furthermore, we distorted geography by replacing zip codes with random zip codes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

De-identifying an EHR Database - Anonymity, Correctness and Readability of the Medical Record

Electronic health records (EHR) contain a large amount of structured data and free text. Exploring and sharing clinical data can improve healthcare and facilitate the development of medical software. However, revealing confidential information is against ethical principles and laws. We de-identified a Danish EHR database with 437,164 patients. The goal was to generate a version with real medica...

متن کامل

ارزیابی کیفیت و بررسی میزان خوانایی اطلاعات سلامت برخط تولید شده توسط وزارت بهداشت، درمان و آموزش پزشکی ایران

Introduction: The Ministry of Health and Medical Education is one of the main providers of health information for patients and caregivers in Iran. The current study aimed to assess the quality and readability of online health information produced by the Ministry and its affiliated organizations. Methods: In this descriptive-survey study, the websites of the Ministry of Health and its affiliate...

متن کامل

بررسی میزان صحت کدگذاری در بیمارستانهای آموزشی دانشگاه علوم پزشکی و خدمات بهداشتی درمانی شیراز

The research was intended to determine the rate of coding accuracy in the training hospitals of Shiraz University of Medical Sciences and Health Treatment Services in 1995 (1374), and it was performed through a descriptive-analytic method. In the research, 400 medical records were selected based on stratified sampling method from among records of the patients having been discharged from hospita...

متن کامل

Beyond Surface Characteristics: A New Health Text-Specific Readability Measurement

Accurate readability assessment of health related materials is a critical first step in producing easily understandable consumer health information resources and personal health records. Existing general readability formulas may not always be appropriate for the medical/consumer health domain. We developed a new health-specific readability pilot measure, based on the differences in semantic and...

متن کامل

Assessing the Readability of Patient Education Materials about Diabetes Available in Shiraz Health Centers

Introduction: Patient education materials are one of the important factors to improve the health literacy of patients with chronic diseases like diabetes and are employed in order to develop self-care skills. These materials will meet such objectives if they are understandable by their audiences. Hence, the aim of present study was to evaluate the readability of educational resources published ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Health informatics journal

دوره 23 4  شماره 

صفحات  -

تاریخ انتشار 2017